A Filter-and-Refine Indexing Method for Fast Similarity Search in Millions of Music Tracks
نویسندگان
چکیده
We present a filter-and-refine method to speed up acoustic audio similarity queries which use the Kullback-Leibler divergence as similarity measure. The proposed method rescales the divergence and uses a modified FastMap [1] implementation to accelerate nearest-neighbor queries. The search for similar music pieces is accelerated by a factor of 10−30 compared to a linear scan but still offers high recall values (relative to a linear scan) of 95− 99%. We show how the proposed method can be used to query several million songs for their acoustic neighbors very fast while producing almost the same results that a linear scan over the whole database would return. We present a working prototype implementation which is able to process similarity queries on a 2.5 million songs collection in about half a second on a standard CPU.
منابع مشابه
Indexing Content-Based Music Similarity Models for Fast Retrieval in Massive Databases
This thesis develops a large-scale music recommendation system. To achieve this goal we solve three problems preventing the currently top-performing class of content-based music similarity algorithms from being used as recommendation engine in huge databases with millions of songs: First, we show how to correctly use the non-vectorial music similarity features with their nonmetric divergences i...
متن کاملMidiFind: Fast and Effective Similarity Searching in Large MIDI databases
How can we quickly search millions of MIDI files for similar pieces of music? Our MidiFind system focuses on exactly this problem. It has the following desirable properties: (a) it is effective: thanks to our novel features and corresponding similarity measurements, it outperforms traditional competitors, in terms of precision and recall; (b) it is scalable: thanks to our hybrid (MF) indexing s...
متن کاملPrototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica
Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...
متن کاملClassification of Iranian Traditional Music Dastgahs Using Features Based on Pitch Frequency
The Iranian traditional music is composed of seven majors Dastgahs: Chahargah, Homayoun, Mahour, Segah, Shour, Nava, and Rast-Panjgah. In this paper, a new algorithm for the classification of the Iranian traditional music Dastgahs based on pitch frequency is proposed. In this algorithm, the features of Lagrange coefficients of pitch logarithm (LCPL), Fuzzy similarity sets type 2 (FSST2), and th...
متن کاملAugmenting Text-based Music Retrieval with Audio Similarity: Advantages and Limitations
We investigate an approach to a music search engine that indexes music pieces based on relatedWeb documents. This allows for searching for relevant music pieces by issuing descriptive textual queries. In this paper, we examine the effects of incorporating audio-based similarity into the text-based ranking process – either by directly modifying the retrieval process or by performing post-hoc aud...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009